Picture for Dacheng Tao

Dacheng Tao

JD Explore Academy, JD.com, China

Analytic Energy-Guided Policy Optimization for Offline Reinforcement Learning

Add code
May 03, 2025
Viaarxiv icon

Adaptively Point-weighting Curriculum Learning

Add code
May 03, 2025
Viaarxiv icon

Low-Precision Training of Large Language Models: Methods, Challenges, and Opportunities

Add code
May 02, 2025
Viaarxiv icon

AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization

Add code
Apr 30, 2025
Viaarxiv icon

Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection

Add code
Apr 29, 2025
Viaarxiv icon

Plasticine: Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning

Add code
Apr 24, 2025
Viaarxiv icon

T2VShield: Model-Agnostic Jailbreak Defense for Text-to-Video Models

Add code
Apr 22, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

The Other Side of the Coin: Exploring Fairness in Retrieval-Augmented Generation

Add code
Apr 19, 2025
Viaarxiv icon

A Physics-guided Multimodal Transformer Path to Weather and Climate Sciences

Add code
Apr 19, 2025
Viaarxiv icon